Improved Sensitivity of Nucleic Acid Database Searches Using Application-Specific Scoring Matrices

نویسندگان

  • David J. States
  • Warren Gish
  • Stephen F. Altschul
چکیده

Scoring matrices for nucleic acid sequence comparison that are based on models appropriate to the analysis of molecular sequencing errors or biological mutation processes are presented. In mammalian genomes, transition mutations occur significantly more frequently than transversions, and the optimal scoring of sequence alignments based on this substitution model differs from that derived assuming a uniform mutation model. The information from sequence alignments potentially available using an optimal scoring system is compared with that obtained using the BLASTN default scoring. A modified BLAST database search tool allows these, or other explicitly specified scoring matrices, to be utilized in computationally efficient queries of nucleic acid databases with nucleic acid query sequences. Results of searches performed using BLASTN's default score matrix are compared with those using scores based on a mutational model in which transitions are more prevalent than transversions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sensitivity and Specificity of Nucleic Acid Sequence-Based Amplification Method for Diagnosis of Cutaneous Leishmaniasis

Abstract Background and Objective: Culture, microscopic method is a gold standard method for identification of Lishmania parasite. The use of Molecular methods such as RT- PCR compared to microscopic methods has a higher sensitivity and specificity however, it is not widely used due to its expensive equipment and the time requested. The use of nucleic acid sequence based amplification (NASBA) ...

متن کامل

FOOTER: a web tool for finding mammalian DNA regulatory regions using phylogenetic footprinting

FOOTER is a newly developed algorithm that analyzes homologous mammalian promoter sequences in order to identify transcriptional DNA regulatory 'signals'. FOOTER uses prior knowledge about the binding site preferences of the transcription factors (TFs) in the form of position-specific scoring matrices (PSSMs). The PSSM models are generated from known mammalian binding sites from the TRANSFAC da...

متن کامل

PrediSi: prediction of signal peptides and their cleavage positions

We have developed PrediSi (Prediction of Signal peptides), a new tool for predicting signal peptide sequences and their cleavage positions in bacterial and eukaryotic amino acid sequences. In contrast to previous prediction tools, our new software is especially useful for the analysis of large datasets in real time with high accuracy. PrediSi allows the evaluation of whole proteome datasets, wh...

متن کامل

Strategies for the effective identification of remotely related sequences in multiple PSSM search approach.

Searches using position specific scoring matrices (PSSMs) have been commonly used in remote homology detection procedures such as PSI-BLAST and RPS-BLAST. A PSSM is generated typically using one of the sequences of a family as the reference sequence. In the case of PSI-BLAST searches the reference sequence is same as the query. Recently we have shown that searches against the database of multip...

متن کامل

JASPAR: an open-access database for eukaryotic transcription factor binding profiles

The analysis of regulatory regions in genome sequences is strongly based on the detection of potential transcription factor binding sites. The preferred models for representation of transcription factor binding specificity have been termed position-specific scoring matrices. JASPAR is an open-access database of annotated, high-quality, matrix-based transcription factor binding site profiles for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991